Extract data from a pdf file and process raw data

Task to do

This notebook will guide you to extract tabular data from a pdf file.

The the overall goal is to extract trajectory data from a pdf file and then plot wellbore trajectory of well Hibernia 1638

Data Source

The following example contains drilling trajaectory data from CNLOPB website.

CNLPOB - Canada-Newfoundland & Labrador Offshore Petroleum Borad

Hibernia 1638

We will use trajectory data from the Well Hibernia 1638. These data availabe on CNLOPB data repository.

Trajectory data available in this link https://home-cnlopb.hub.arcgis.com/pages/hibernia-b-16-38

CNLOPB data repository

You can download the required file (INV-146206.pdf) by clicking the view record

image.png

Since we made change in this dataframe, we could not made any further modification on in 'Latitude' and 'Longitude'. Therefore, we can get them again from back up data.

Let us check the back up data again. It seems that there are unwanted space between number in 'Latitude' and 'Longitude'

Now we will plot the trajectory data in a 3D plot. This will generate a 3D interactive trajectory plot.

It shows an inverse profile for TVD, therefore we will change the axis direction for TVD by change the range for Z axis